Asymptotic Optimality of Sequential Sampling Policies for Bayesian Information Collection
نویسندگان
چکیده
We consider adaptive sequential sampling policies in a Bayesian framework. Under the assumptions that the sampling distribution is from an exponential family and that the number of distinct measurement types is finite, we give sufficient conditions for an adaptive sampling policy to achieve asymptotic optimality. Here, asymptotic optimality is understood to mean that the limit of the expected loss under the given sampling policy as the number of measurements allowed grows to infinity attains the minimum over all possible sampling policies. This property is important because it ensures convergence in the limit for sophisticated policies designed to maximize performance over the short-term. We then apply these sufficient conditions to show asymptotic optimality of three previously proposed ranking and selection policies: OCBA for linear loss, LL(S), and LL(1). We also show how this sufficient condition may be generally applied to a broad class of knowledge-gradient policies.
منابع مشابه
Convergence to Global Optimality with Sequential Bayesian Sampling Policies
We consider Bayesian information collection, in which a measurement policy collects information to support a future decision. This framework includes problems in ranking and selection, reinforcement learning, and continuous global optimization. We give sufficient conditions under which measurement policies achieve asymptotically minimal expected loss. Achieving asymptotically minimal expected l...
متن کاملConsistency of Sequential Bayesian Sampling Policies
We consider Bayesian information collection, in which a measurement policy collects information to support a future decision. This framework includes ranking and selection, continuous global optimization, and many other problems in sequential experimental design. We give a sufficient condition under which measurement policies sample each measurement type infinitely often, ensuring consistency, ...
متن کاملAsymptotic properties of the sample mean in adaptive sequential sampling with multiple selection criteria
We extend the method of adaptive two-stage sequential sampling toinclude designs where there is more than one criteria is used indeciding on the allocation of additional sampling effort. Thesecriteria, or conditions, can be a measure of the targetpopulation, or a measure of some related population. We developMurthy estimator for the design that is unbiased estimators fort...
متن کاملMultihypothesis sequential probability ratio tests - Part I: Asymptotic optimality
The problem of sequential testing of multiple hypotheses is considered, and two candidate sequential test procedures are studied. Both tests are multihypothesis versions of the binary sequential probability ratio test (SPRT), and are referred to as MSPRT’s. The first test is motivated by Bayesian optimality arguments, while the second corresponds to a generalized likelihood ratio test. It is sh...
متن کاملRanking and Selection as Stochastic Control
Under a Bayesian framework, we formulate the fully sequential sampling and selection decision in statistical ranking and selection as a stochastic control problem, and derive the associated Bellman equation. Using value function approximation, we derive an approximately optimal allocation policy. We show that this policy is not only computationally efficient but also possesses both one-step-ahe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008